Picture for Ge Zhang

Ge Zhang

TriLens: Per-Layer Logit-Lens Entropy for White-Box Hallucination Detection

Add code
May 31, 2026
Viaarxiv icon

TraceGraph: Shared Decision Landscapes for Diagnosing and Improving Agent Trajectories

Add code
May 29, 2026
Viaarxiv icon

AbstainGNN: Teaching Graph Neural Networks to Abstain for Graph Classification

Add code
May 29, 2026
Viaarxiv icon

Do LLMs Know Tool Irrelevance? Demystifying Structural Alignment Bias in Tool Invocations

Add code
Apr 13, 2026
Viaarxiv icon

EA-Agent: A Structured Multi-Step Reasoning Agent for Entity Alignment

Add code
Apr 13, 2026
Viaarxiv icon

In-Place Test-Time Training

Add code
Apr 07, 2026
Viaarxiv icon

\$OneMillion-Bench: How Far are Language Agents from Human Experts?

Add code
Mar 09, 2026
Viaarxiv icon

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

Add code
Feb 26, 2026
Viaarxiv icon

WorldTravel: A Realistic Multimodal Travel-Planning Benchmark with Tightly Coupled Constraints

Add code
Feb 09, 2026
Viaarxiv icon

The Optimal Token Baseline: Variance Reduction for Long-Horizon LLM-RL

Add code
Feb 06, 2026
Viaarxiv icon